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Polynucleotide 

Present invention relates to a novel polynucleotide involved in heritable Parkinson's 
disease (PD), a novel polypeptide encoded by the polynucleotide, and a method for 
5 diagnosing heritable Parkinson's disease (PD). 

Background 

Parkinsonism (MIMl 68600) is a clinical syndrome characterized by bradykinesia, 
resting tremor, muscle rigidity, and postural instability (Gelb et al. 1999). The most 

10 common cause of parkinsonism is Parkinson's disease (PD). Second to Alzheimer's 
disease, PD is the most common neurodegenerative disorder affecting >1% of the 
population over 55 years of age (de Rijk et al. 1995). Neuropathological findings in PD are 
loss of pigmented neurons in the brainstem, substantia nigra and locus ceruleus, with 
intracellular Lewy body inclusions found within surviving neurons (Fomo 1996). 

1 5 Although PD is considered a sporadic disease, various hereditary forms of parkinsonism 
have been recognized (Vila and Przedborski 2004). A major breakthrough in recent years 
has been the mapping and cloning of a number of genes causing monogenic forms of 
parkinsonism. Genomic multiplication and missense mutations in the a-synuclein gene 
were initially identified in a small number of families with autosomal dominant 

20 parkinsonism (PARK 1/4 [MIM 168601]) (Polymeropoulos et al. 1997; Kruger et al. 1998: 
Singleton et al. 2003; Chartier-Harlin et al. 2004; Farrer et al. 2004; Zarranz et al. 2004). 
Subsequently, a-synuclein antibodies were found to robustly stain Lewy bodies and Lewy 
neurites in the substantia nigra in familial and sporadic PD (Spillantini et al. 1997) and 
common genetic variability in the a-synuclein promoter has been implicated in sporadic PD 

25 (Pals et al. 2004). 

Autosomal recessive mutations in three genes, parkin, DJ-I and PINKl have been linked 
with early-onset parkinsonism (<45 years at onset) (PARX2, PARK6 & PARK7 [MIM 
602533, 602544 & 608309])(Kitada et al. 1998; Bonifati et al. 2003; Valente et al. 2004). A 
large number of pathogenic mutations and rearrangements have been identified in the 

30 parkin gene reviewed by (Mata et al. 2004), but mutations in DJ-1 and PINK-1 are rare 
(unpublished data). 

Very recently, five pathogenic mutations were identified in a gene, leucine-rich repeat 
kinase 2 (LRRK2) in six families with autosomal-dominant parkinsonism, linked to the 
PARKS locus [MIM 607060]) (Zimprich et al. 2004a). Paisan-Ruiz and colleagues 
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independently confirmed these findings of two pathogenic mutations in a British and 
Basque families (Paisan-Ruiz et al. x2004). 

Object 

5 The object of the invention is to isolate a gene or pol>Tiucleotide proving inheritable 
parkinsonism, and to use presence of this gene to diagnose a patient before he/her gets sick. 
A further object is to use this gene or polynucleotide to transfect a microorganism or 
experimental animal in order to develop a new medicine for treating or preventing the onset 
of parkinsonism. 

10 

The invention 

InJieritablc parkinsonism may be proved by the method according to the characterizing 
part of claim 5, and the other objects are met by a polynucleotid according to the 
characterizing part of claim 1 , a recombinant vector according to claim 3, a DNA probe and 
1 5 a DNA primer according to claims 4 and 6 respectively, and a peptide according to claim 9. 

The inventors have isolated a novel LRRK2 mutation, and this mutation may cause 
development of dominantly inherited PD. By screening healthy persons, one can state 
whether the healthy persons have the mutation, and thus most likely will develop the 
20 illness. 

Using a probe to test whether a patient has the mutation allows a precise, differential 
diagnosis of this type of Parkinson's disease. Tlie probe represents a safe and accurate 
biomarker which will be powerful as it nominates subjects, future patients, for 
neuroprotective therapy. At the present time this is a research enterprise, but not for 

25 long. These subjects provide the first (and only) 'uniform substrate/background' for studies 
on drug efficacy/safety. From a research perspective they will also facilitate models of 
disease (C.elegans, Drosophila, mice) and epidemiological research on the variable 
expressivity and age-associated penetrance. As the sequence of the mutated gene is 
known, microorganisms and further experimental animals may be transfected, in 

30 order to investigate for a new medicine to treat or prevent the onset of the illness. 

ITie genetic information provides subjects with the cause of their disease, an explanation 
for which, if handled correctly, can be of great psychological benefit (fulfilling the 'need to 
know' why). This information also prioritizes the resources of the research community. 
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grant funding agencies and the pharmaceutical industry on developing a neuroprotective 
therapies to halt G2019S disease progression. 

In the following the invention will be described by reference to a study of PD patients 
5 and their families. Parts of the study are shown in figures, wherein 

Figure 1 shows a schematic drawing of LRRK2 with predicted protein domains, 

Figure 2 shows pedigrees of families with LRRK2 G2019S, 

Figure 3 shows chromosome 12ql2 STR markers on the disease haplotype (PARK 8), 
Figure 4 shows probability of becoming affected by parkinsonism, in LRRK2 G2019S 
10 carriers, as a function of age, and 

Figure 5 shows aligned amino acid sequences of the activation loop of different human 
kinases. 

Tlie inventors identified seven unrelated persons all having the new mutation, from 248 
1 5 multiplex kindreds with dominantly inherited PD, and six fiirther unrelated persons from 
three populaton-based series of persons with dominantly inhereted PD. These 13 persons 
and their families made basis for the inventors' further work. Segregation and linkage 
analysis provides evidence for pathogenicity and an estimate of age-associated penetrance; 
haplotype analysis demonstrates the mutation originates from a common and ancient 
20 founder. 

Subjects and Methods 

Study subjects 

The patients and controls were examined by neurologists specialized in movement 
25 disorders. A full history, including family history and neurological examination, was 

completed on each patient. Clinical diagnosis of PD required the presence of at least two of 
three cardinal signs (resting tremor, bradykinesia and rigidity), improvement from adequate 
dopaminergic therapy and the absence of atypical features or other causes of parkinsonism. 

30 LRRK2 sequencing and mutation screening 

Blood samples were taken and genomic DNA was extracted using standard techniques. 
Six families (families 194, 281, 3081, 3082, 3083 and 321 1) were known to have a positive 
LOD-score for STR (Short Tandem Repeat) markers in the PARKS locus (Zimprich et al. 
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2004b). Amplification of all 51 exons of the LRRK2 gene was performed by polymerase 
chain reaction (PGR) in one patient having PD, from each of these six families. All PCRs 
were carried out for each primer set with 20-50 ng of template DNA in a total volume of 
25^1 using a final reaction concentration of 200 jiM dNTP, Ix PCR-Buflfer (Qiagen), Ix Q- 
5 Solution (Qiagen), and 0.8 jiM of each primer. One unit of Taq polymerase (Qiagen) was 
added to each reaction. Amplification was performed using a 57-52'C-touchdown protocol 
over 38 cycles. The primers used for PGR amplification o(LRRK2 exons and for 
sequencing are available on request. 

The nucleotide sequences of all PGR products were determined by direct sequencing. 

10 Each PGR product was cleaned by using a Millipore PGR purification plate. Three 

microliters of purified PGR product was used per sequencing reaction with I nl of either the 
forward or reverse PGR primer and 1 ^1 of BigDye reaction mix (Applied Biosystems). 
Electrophoresis was performed imder standard conditions on an ABI 3730 automated 
sequencer (Applied Biosystems). All sequences were obtained with both forward and 

1 5 reverse primers. Sequences were analyzed with SeqScape software version 2.1 .1 (Applied 
Biosystems) and compared with published sequence of LRRK2 (GenBank accession no. 
AY792511). 

After identification of a heterozygous G20I9S (G6055A) mutation in the proband of 
family 321 5 (referred to as family 321 1 in Zimprich et al, 2004b), we designed a probe 

20 employing TaqMan chemistry on an AB17900 (Applied Biosystems) to screen for this 
mutation. First we examined 248 PD patients from families with a known family history, 
consistent with autosomal dominant transmission of a suspected causative gene. Then 377 
Norwegian, 271 Irish and 100 Polish PD patients (constituting the three population series) 
were checked using this assay; and 2260 samples of healthy persons fi'om similar 

25 populations were also included (1200 US American, 550 Norwegian, 330 Irish and 180 
Polish subjects), the latter to be used as control samples. Mutations were confirmed by 
direct sequencing of PGR products from LRRK2 exon 41 . Finally, all participating family 
members of LRRK2 G20 19 mutation carriers (affected and unaffected) were screened for 
the mutation. 

30 By 6055 G>A or G6055A it is meant that nucleotide number 6055 of the LRRK2 gene, 
counted fi-om the 5*end of the polynucleotide, has changed from G (guanine) to A 
(adenine). This change also causes a change in the polypeptide encoded by the 
polynucleotide, and G2019S denotes a polynucleotide where aminoacid number 2019 is 
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changed from G(Giycine) to S(Serine). These shortenings are wellknown to persons skilled 
of the art. 

Genotyping ofSTR markers 

5 Fourteen STR markers were genotyped in mutation carriers and all available family 
members, in all 13 families, for linkage analyses and to determine whether there was a 
particular haplotype associated with the LRRK2 mutation. STR markers were chosen to 
span the PARKS region including D12S87, D12S1648, D12S2080, D12S2194, D12S1048, 
D12S1301 and D12S1701. LRRK2 is located between D12S2194 and D12S1048. We also 
10 developed seven novel STR markers in this region (shown in table 1 below) by searching 
for repeat polymorphisms using RepeatMasker of in silica BAG sequence (UCSC Human 
Genome Browser Web site). The labeling of these novel markers reflects their physical 
position relative to the start codon of LRRK2. 

15 



Table 1. Novel chromosome 12 STR markers 


Marker name 


Primer sequence 


Physical position 
(bp) On 

chromosome 12 


D12S2514 


F: 5 '-11 GCAGCTGTAAGGAATTTGGG-3 ' 


38873779 




R: 5'-GCATTCTTCAGCCTGAGACCC-3' 




D12S2515 


F: 5'-TGAAGGACACTGAACAAGATGG-3' 


38974140 




R: 5'-GCCATAGTCCTTCCATAGTTCC-3' 




D12S2516 


F: 5'-CGCAGCGAGCATTGTACC-3' 


38989214 




R: 5'-CTCGGAAAGTTTCCCAATTC-3' 




D12S2518 


F: 5'-CTGGTATTACCTCAACTGTGGCTC-3' 


39034800 




R: 5'-ACTGGTATGTTTAAGCCTGGCAC-3' 




D12S2519 


F: 5'-AGCAGCAGAGAAGATTTCAATAAC-3' 


39116816 




R: 5'-AATCATCTTTGAAAGAACCAGG-3' 




D12S2523 


F: 5'-TAAACGAAGCTCCCTCACTGTAAG-3' 


39147728 




R: 5' - TCrn G rAGC'rGCGGTTGrrTC-3 ' 




DI2S2517 


F: 5'-TCATGAAGATGTCTGTGATAGGGC-3' 


39282976 




R: 5'-CTCTATTGTGAGCAAACTGCATGG-3' 
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One primer of each pair was labeled with a fluorescent tag. PGR reactions were canied 
out on 10-20 ng of DNA in a total volume of 15 |il with final reaction concentrations of 150 
|xM dNTP, Ix PCR-Buffcr (Qiagen), Ix Q-Solution (Qiagen) and 0.6 \iM of each primer, 
with 1 unit of Taq Polymerase (Qiagen). Amplification was performed using a 57-52X- 

5 touchdown protocol over 38 cycles. The PGR product for each marker was diluted by a 
factor of 10 to 100 with water. One microliter was then added to 10 |xl of Hi-Di Formamide 
and Rox size standard. All samples were run on an ABl 3 100 genetic analyzer, and results 
were analyzed using Genescan 3.7 and Genotyper 3.7 software (Applied Biosystems). 
Since population allele frequencies were not available from the CEPH database, these have 

10 been estimated by genotyping 95 unrelated Caucasian subjects, a population based series 
from the United States (shown in table 2 below). 



Table 2. Allele frequencies of Park 8 Markers 


Marker and allele (bp) 


Frequency (%) 


D12S87 (n = 92) 




150 


0.5 


154 


l.l 


156 


27.2 


158 


33.2 


160 


11.4 


162 


2.7 


164 


6.0 


166 


17.4 


168 


0.5 


D12S1648(n = 91) 




no 


13.7 


1 12 


3.3 


114 


11.0 


116 


4.4 


118 


2.2 


120 


2.8 


122 


17.0 


124 


3.9 


126 


7.7 


128 


14.3 


130 


8.8 


132 


2.8 


134 


2.8 


136 


1.7 


138 


0.6 


140 


2.2 


142 


l.l 


D12S2080 (n = 93) 
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Table 2. Allele frequencies of Park 8 Markers 


Marker and allele (bp) 


Frequency (%) 


134 


22.6 


138 


22.6 


140 


25.3 


D12S2520(N-85) 




248 


8.2 


251 


7.6 


254 


10,0 


257 


54.1 


260 


20.0 


D12S2521 (N = 93) 




311 


0.5 


315 


10.8 


319 


20.4 


323 


8.1 


327 


7.0 


331 


8. 1 


335 


0.5 


355 


i.l 


359 


7.5 


363 


13.4 


367 


7.0 


371 


7.0 


375 


6.5 


379 


3.8 


383 


1.1 


387 


.5 


D12S2522 (N = 93) 




281 


9.1 


283 


14.0 


285 


.5 


287 


1 1.3 


293 


.5 


295 


15.6 


297 


44.6 


299 


4.3 


DI2S2523 (n-89) 




305 


18.9 


314 


41.1 


317 


8.9 


320 


30.0 


323 


1.1 
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Table 2. Allele frequencies of Park 8 Markers 


(Marker and allele (bp) 


Frequency (%) 


DI2S2517(n = 93) 




180 


8.5 


182 


7.5 


184 


15.4 


186 


8.5 


188 


11.7 


190 


8.0 


192 


5,3 


194 


1.1 


196 


1.1 


198 


3.2 


200 


0.5 


202 


3.7 


204 


6.9 


206 


6.9 


208 


4.3 


210 


2.1 


212 


3.2 


214 


1.6 


216 


0.5 


Dl2S1048 (n = 89) 




211 


37.2 


214 


21.1 


217 


17.8 


220 


2.2 


223 


6.7 


226 


11.7 


229 


3.3 


DI2S1301 (n = 93) 




96 


0.5 


100 


37.2 


104 


17.6 


108 


11.1 


112 


12.2 


116 


13.3 


120 


7.5 


124 


0.5 
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Table 2. Allele frequencies of Park 8 Markers 


Marker and allele (bp) 


P'requency (%) 


DI2S170l{n = 93) 




89 


4.3 


91 


4.8 


93 


10.8 


95 


40.0 


97 


16.0 


99 


12.4 


101 


11.8 


103 


0.5 



A The number of individuals gcnotypcd is given for each marker (n) 
B Allc frequencies are for individual markers in U.S. control subjects 



5 Sun isiical A nalysLs 

Multipoint nonparametric LOD scores for all families were calculated using 

GENEHUNTER-PLUS (Kong and Cox 1997). The frequency of the deleterious allele was 

set at 0.0001, and empirically determined allele frequencies were employed. ITie map 

positions for each marker were taken from Rutgers combined linkage-physical map version 
10 1.0 (M AP-O-M AT web site). The three loci D 1 2S2080, D 1 2S2 1 94 and D 1 2S 1 30 1 are very 

tightly linked, with no observed recombinants in the database or within our genotyped 

families, and thus inter-marker distances were assigned as O.OlcM. 

Chromosome 12 haplotypes in the PARKS region were established for those families in 

which chromosome phase for mutation-carrying individuals could be deduced, thereby 
15 determining which alleles co-segregated with the LRRK2 G2019S mutation in each family. 

For those affected individuals in whom the associated allele for a marker could not be 

determined, both alleles are given. 

The age-dependent penetrance was estimated as the probability of a gene carrier 

becoming affected, at a given age, within the 13 families. The number of affected mutation 
20 carriers, for each decade, was divided by the total number of affected individuals, plus the 

number of unaffected carriers within that range. For some affected family members no 

DNA was available and only historical data on the disease course was obtained. These 

individuals were excluded from penetrance calculations. 

25 Results 

As mentioned previously, we identified 13 affected probands (i.e. 13 patients) who 
carry a heterozygous G6055A mutation in exon 41 of the LRRK2 gene. The mutation leads 
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lo a G2019S amino acid substitution of a highly conserved residue within the predicted 
activation loop of the MAPKKK (Mitogen- Activated Protein Kinase Kinase Kinase) 
domain (figure 1). After genotyping a total of 42 additional family members, 22 additional 
subjects were found to carry the mutation, seven with a diagnosis of PD (shown in table 3 

5 below). One affected member of family P-089 did not carry the mutation and, for the 
purposes of this study, was considered a phenocopy and excluded from further analyses. 
Seven families originated from Norway, three were from the United States, two from 
Ireland, and one was from Poland. One family from the United States descended from 
Russian/Rumania, and another from Italy. For only one family (family 111), the ethnic 

10 origin was unknown. The LRRK2 G2019S mutation segregates with disease in all kindreds, 
consistent with autosomal dominant transmission. To ensure patient confidenliality, 
simplified versions of the family pedigrees are presented in figure 2, There was no evidence 
of the mutation in the 2260 control samples. 

Age at onset of clinical symptoms was quite variable, even within the same family. Family 
15 1 1 20, a family from the United States, had both the earliest and latest age at onset for a 
patient. The youngest affected subject had an onset at 39 years, whereas the oldest carrier 
presented with initial symptoms at 78 years. Where recorded, most LRRK2 G2019S carriers 
have late-onset disease (>50 years at onset). The mean age at onset of affected mutation 
carriers was 56.8 years (range 39-78 years, n=19). Unaffected carriers have a mean age of 
20 53.9 years (range 26-74 years, n=14). The penetrance of the mutation was found to be 

highly age-dependent, increasing from 17% at the age of 50 to 85% at the age of 70 (figure 
4). 
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All affected members from the different families, except the individual in family P-089 
who did not carry the mutation, appear to share a common haplotype on chromosome 12 
the LRRK2 gene locus (figure 3). Haplotypes can be established with certaint)' in nine of 
the families, and all mutation carriers in these families share alleles for four STR markers 

5 and 4 single nucleotide polymorphisms (SNPs) in the LRRK2 gene locus. These markers 
are LRRK2 D 1 2S25 1 6, D 1 2S25 1 8, Dl 2S25 1 9, Dl 2S2520 and SNPs rs7966550, 
rsl427263, rsl 1 176013, rsll 564148. For the remaining families, the number of available 
samples from relatives was not sufficient to determine phase. However, the genotypes in 
these cases are consistent with a common LRRK2 G2019S allele. D12S2516 is located in 

10 intron 29 and D 1 2S25 1 8 is located in intron 44 of the LRRK2 gene, whereas the two other 
shared markers are positioned 3' of the gene. Using the physical position of the shared and 
non-shared markers, the size of tlie shared haplotype is between 145 kb and 154 kb. 



Discussion 

15 We have identified a novel LRRK2 mutation, G2019S, which co-segregates with 
autosomal dominant parkinsonism in 13 kindreds originating from several European 
populations. Positive LOD scores were obtained in multiplex families, and combined they 
provide significant support for the PARKS locus. LRRK2 G2019S mutation was absent in a 
large number of control subjects, and of similar ethnicity. The number of families linked to 

20 LRRK2 in this and previous studies now explains the majority of genetically defined 
autosomal dominant parkinsonism. 

The mean age at onset of affected LRRK2 G2019S carriers was 56.8 years, and 
comparable to that of patients in other families linked to PARKS (Funayama et al. 2002; 
Paisan-Ruiz el al. 2004; Zimprich et al. 2004a). The majority of patients present with late- 

25 onset disease, indistinguishable from typical idiopathic PD. Disease penetrance is age- 
dependent, and increases in a linear fashion from 1 7% at the age of 50 to 85% at the age of 
70. Age is the single most consistent risk factor for development of PD and other 
neurodegenerative disorders (Lang and Lozano 1998), and an important risk factor in 
LRRK2 associated parkinsonism. Interestingly, age at onset was variable in this study, both 

30 within and between different families, suggesting other susceptibility factors, 
environmental or genetic, may influence the phenotype. 

Although our findings clearly indicate that LRRK2 mutations account for a substantial 
proportion of familial late-onset parkinsonism, historically, cross-sectional twin studies 
have not supported a genetic etiology for late-onset PD (Tanner et al. 1999; Wirdefeldt et 
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al. 2004). The age-associated penetrance of LRRK2 mutations provides some explanation 
as even large and well designed twin studies arc underpowered to detect incompletely 
penetrant mutations (Simon et al. 2002). LRRK2 mutations were also found in apparently 
sporadic PD patients; three of the patients in this study did not have any known affected 
5 first- or second-degree relatives. However, a caveat of age-dependent penetrance is that 
carriers may die of other diseases, before manifesting or being diagnosed with PD. Thus, it 
seems difficult to separate sporadic and familial PD, or to hypothesize environmental 
causes to be more important in one group and genetic causes more prominent in the other. 
In light of these results, a family history of parkinsonism, previously considered an 

10 exclusion criterion for a diagnosis of PD, must be reconsidered (Hughes et al. 1992). 

LRRK2 is a member of the recently defined ROCO protein family (Bosgraaf and Van 
Haastert 2003). In human, mouse and rat, members of the ROCO protein family have five 
conser\'ed domains (figure 1). The kinase domain belongs to the MAPKKK subfamily of 
kinases. The active sites of all kinases are located in a cleft between an N-terminal and a C- 

1 5 terminal lobe, typically covered by an 'activation loop', in an inactive conformation. The 
activation loop must undergo crucial structural changes to allow access to peptide 
substrates and to orientate key catalytic amino acids (Huse and Kuriyan 2002). In different 
kinases, the activation loop starts and ends with the conserved residues asp-phe-gly (DFG) 
and ala-pro-glu (APE), respectively (Dibb et al. 2004). Of note, the LRRK2 G2019S 

20 substitution changes a highly conserved amino acid at the start of this loop (figure 5). In a 
German family we previously described, an I2020T mutation is located in an adjacent 
codon (Zimprich et al. 2004a). In other kinases, oncogenic mutations in residues within the 
activation loop of the kinase domain have an activating effect (Davies et al. 2002), thus we 
postulate LRRK2 G2019S and 12020T mutations may have an effect on its kinase activity. 

25 The age of an allele may be estimated fi-om the genetic variation among different copies 
(intra-allelic variation), or from its fi-equency (Slatkin and Rannala 2000). However, the 
local recombination rate on chromosome 12ql2 is unknovvTi, as is the frequency of the 
G2019S mutation in the general population. Nevertheless, at centromeres there is generally 
a dearth in recombination; indeed no crossovers have been observed between LRRK2 

30 fianking markers D 1 2S2 1 94 and D 1 2S 1 048 in our studies, or within CEPH families (MAP- 
0-MAT web site). The physical size of the shared haplotype is also small, between 145 kb 
and 154 kb. and the allele is widespread in families fi-om several European populations. 
Hence, the mutation is likely to be ancient and may be relatively common in specific 
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populations. These data suggest a substantial proportion of late-onset PD will have a 
genetic basis. 

5 Electronic-Database Information 

The physical position of markers is from NCBI build 34. Accession numbers and URLs for 
data presented herein are as follows: 

Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim/ 
1 0 M AP-O-MAT, http://compgen.rutgers.edu/mapomal 
RepeatMasker, http://www.repeatraasker.org/ 
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